55 research outputs found

    Chromosomer: A Reference-Based Genome Arrangement Tool for Producing Draft Chromosome Sequences

    Get PDF
    Background: As the number of sequenced genomes rapidly increases, chromosome assembly is becoming an even more crucial step of any genome study. Since de novo chromosome assemblies are confounded by repeat-mediated artifacts, reference-assisted assemblies that use comparative inference have become widely used, prompting the development of several reference-assisted assembly programs for prokaryotic and eukaryotic genomes. Findings: We developed Chromosomer – a reference-based genome arrangement tool, which rapidly builds chromosomes from genome contigs or scaffolds using their alignments to a reference genome of a closely related species. Chromosomer does not require mate-pair libraries and it offers a number of auxiliary tools that implement common operations accompanying the genome assembly process. Conclusions: Despite implementing a straightforward alignment-based approach, Chromosomer is a useful tool for genomic analysis of species without chromosome maps. Putative chromosome assemblies by Chromosomer can be used in comparative genomic analysis, genomic variation assessment, potential linkage group inference and other kinds of analysis involving contig or scaffold mapping to a high-quality assembly

    A Moving Landscape for Comparative Genomics in Mammals

    Get PDF
    Today we count some 62,000 species of vertebrates (half are fishes) including some 550 species of mammals on earth. The genome sequencing of non-laboratory species in recent years is expanding our breadth and understanding of genetic bases of adaptation and evolution in varied and amazing ways. Recent completion and inspection of whole genome sequence and assembly for over 200 species of mammals, from platypus to panda to human, offer the prospect of a better view of the patterns of changes within genome organization across the mammalian radiations. In 2009 my colleagues and I have created Genome-10K, an international consortium of scientist who have set a goal of gathering, sequencing, assembling, and annotating to high quality some 10,000 vertebrate genomes with 2nd and 3rd generation sequencing technology within the coming five years. These activities and advances provide an enormous Bioinformatics challenge whose solution will provide future zoologists of every persuasion a genome sequence resource for their favorite study animal. The applications and potential for the genome sequence in several research questions will be discussed

    Draft Genome of Bugula neritina, a Colonial Animal Packing Powerful Symbionts and Potential Medicines

    Get PDF
    Many animal phyla have no representatives within the catalog of whole metazoan genome sequences. This dataset fills in one gap in the genome knowledge of animal phyla with a draft genome of Bugula neritina (phylum Bryozoa). Interest in this species spans ecology and biomedical sciences because B. neritina is the natural source of bioactive compounds called bryostatins. Here we present a draft assembly of the B. neritina genome obtained from PacBio and Illumina HiSeq data, as well as genes and proteins predicted de novo and verified using transcriptome data, along with the functional annotation. These sequences will permit a better understanding of host-symbiont interactions at the genomic level, and also contribute additional phylogenomic markers to evaluate Lophophorate or Lophotrochozoa phylogenetic relationships. The effort also fits well with plans to ultimately sequence all orders of the Metazoa

    Draft de novo Genome Assembly of the Elusive Jaguarundi, Puma yagouaroundi

    Get PDF
    The Puma lineage within the family Felidae consists of 3 species that last shared a common ancestor around 4.9 million years ago. Whole-genome sequences of 2 species from the lineage were previously reported: the cheetah (Acinonyx jubatus) and the mountain lion (Puma concolor). The present report describes a whole-genome assembly of the remaining species, the jaguarundi (Puma yagouaroundi). We sequenced the genome of a male jaguarundi with 10X Genomics linked reads and assembled the whole-genome sequence. The assembled genome contains a series of scaffolds that reach the length of chromosome arms and is similar in scaffold contiguity to the genome assemblies of cheetah and puma, with a contig N50 = 100.2 kbp and a scaffold N50 = 49.27 Mbp. We assessed the assembled sequence of the jaguarundi genome using BUSCO, aligned reads of the sequenced individual and another published female jaguarundi to the assembled genome, annotated protein-coding genes, repeats, genomic variants and their effects with respect to the protein-coding genes, and analyzed differences of the 2 jaguarundis from the reference mitochondrial genome. The jaguarundi genome assembly and its annotation were compared in quality, variants, and features to the previously reported genome assemblies of puma and cheetah. Computational analyzes used in the study were implemented in transparent and reproducible way to allow their further reuse and modification. </p

    El primer genoma mitocondrial completo de Diadema antillarum (Diadematoida, Diadematidae)

    Get PDF
    The mitochondrial genome of the long-spined black sea urchin, Diadema antillarum, was sequenced using Illumina next-generation sequencing technology. The complete mitogenome is 15,708&nbsp;bp in length, containing two rRNA, 22 tRNA and 13 protein-coding genes, plus a noncoding control region of 133&nbsp;bp. The nucleotide composition is 18.37% G, 23.79% C, 26.84% A and 30.99% T. The A + T bias is 57.84%. Phylogenetic analysis based on 12 complete mitochondrial genomes of sea urchins, including four species of the family Diadematidae, supported familial monophyly; however, the two Diadema species, D. antillarum and D. setosum were not recovered as sister taxa.El genoma mitocondrial del erizo de mar negro de espinas largas, Diadema antillarum, se secuenció utilizando la tecnología de secuenciación de nueva generación de Illumina. El mitogenoma completo tiene un tamaño de 15,708 pb, que contiene dos ARNr, 22 ARNt y 13 genes codificadores de proteínas, además de una región de control no codificante de 133 pb. La composición de nucleótidos es 18.37% G, 23.79% C, 26.84% A y 30.99% T. El sesgo A+T es del 57.84%. El análisis filogenético basado en 12 genomas mitocondriales completos de erizos de mar, incluyendo cuatro especies de la familia Diadematidae, apoya la monofilia familiar. Sin embargo, &nbsp;las dos especies de Diadema en este estudio, &nbsp;D. antillarum y D. setosum no fueron identificadas como taxones hermanos

    Genome-wide signatures of complex introgression and adaptive evolution in the big cats.

    Get PDF
    The great cats of the genus Panthera comprise a recent radiation whose evolutionary history is poorly understood. Their rapid diversification poses challenges to resolving their phylogeny while offering opportunities to investigate the historical dynamics of adaptive divergence. We report the sequence, de novo assembly, and annotation of the jaguar (Panthera onca) genome, a novel genome sequence for the leopard (Panthera pardus), and comparative analyses encompassing all living Panthera species. Demographic reconstructions indicated that all of these species have experienced variable episodes of population decline during the Pleistocene, ultimately leading to small effective sizes in present-day genomes. We observed pervasive genealogical discordance across Panthera genomes, caused by both incomplete lineage sorting and complex patterns of historical interspecific hybridization. We identified multiple signatures of species-specific positive selection, affecting genes involved in craniofacial and limb development, protein metabolism, hypoxia, reproduction, pigmentation, and sensory perception. There was remarkable concordance in pathways enriched in genomic segments implicated in interspecies introgression and in positive selection, suggesting that these processes were connected. We tested this hypothesis by developing exome capture probes targeting ~19,000 Panthera genes and applying them to 30 wild-caught jaguars. We found at least two genes (DOCK3 and COL4A5, both related to optic nerve development) bearing significant signatures of interspecies introgression and within-species positive selection. These findings indicate that post-speciation admixture has contributed genetic material that facilitated the adaptive evolution of big cat lineages

    White Shark Genome Reveals Ancient Elasmobranch Adaptations Associated with Wound Healing and the Maintenance of Genome Stability

    Get PDF
    The white shark (Carcharodon carcharias; Chondrichthyes, Elasmobranchii) is one of the most publicly recognized marine animals. Here we report the genome sequence of the white shark and comparative evolutionary genomic analyses to the chondrichthyans, whale shark (Elasmobranchii) and elephant shark (Holocephali), as well as various vertebrates. The 4.63-Gbp white shark genome contains 24,520 predicted genes, and has a repeat content of 58.5%. We provide evidence for a history of positive selection and gene-content enrichments regarding important genome stability-related genes and functional categories, particularly so for the two elasmobranchs. We hypothesize that the molecular adaptive emphasis on genome stability in white and whale sharks may reflect the combined selective pressure of large genome sizes, high repeat content, high long-interspersed element retrotransposon representation, large body size, and long lifespans, represented across these two species. Molecular adaptation for wound healing was also evident, with positive selection in key genes involved in the wound-healing process, as well as Gene Ontology enrichments in fundamental wound-healing pathways. Sharks, particularly apex predators such as the white shark, are believed to have an acute sense of smell. However, we found very few olfactory receptor genes, very few trace amine-associated receptors, and extremely low numbers of G protein-coupled receptors. We did however, identify 13 copies of vomeronasal type 2 (V2R) genes in white shark and 10 in whale shark; this, combined with the over 30 V2Rs reported previously for elephant shark, suggests this gene family may underlie the keen odorant reception of chondrichthyans

    Genomic Legacy of the African Cheetah, Acinonyx jubatus

    Get PDF
    Background Patterns of genetic and genomic variance are informative in inferring population history for human, model species and endangered populations. Results Here the genome sequence of wild-born African cheetahs reveals extreme genomic depletion in SNV incidence, SNV density, SNVs of coding genes, MHC class I and II genes, and mitochondrial DNA SNVs. Cheetah genomes are on average 95 % homozygous compared to the genomes of the outbred domestic cat (24.08 % homozygous), Virunga Mountain Gorilla (78.12 %), inbred Abyssinian cat (62.63 %), Tasmanian devil, domestic dog and other mammalian species. Demographic estimators impute two ancestral population bottlenecks: one \u3e100,000 years ago coincident with cheetah migrations out of the Americas and into Eurasia and Africa, and a second 11,084–12,589 years ago in Africa coincident with late Pleistocene large mammal extinctions. MHC class I gene loss and dramatic reduction in functional diversity of MHC genes would explain why cheetahs ablate skin graft rejection among unrelated individuals. Significant excess of non-synonymous mutations in AKAP4 (p\u3c0.02), a gene mediating spermatozoon development, indicates cheetah fixation of five function-damaging amino acid variants distinct from AKAP4 homologues of other Felidae or mammals; AKAP4 dysfunction may cause the cheetah’s extremely high (\u3e80 %) pleiomorphic sperm. Conclusions The study provides an unprecedented genomic perspective for the rare cheetah, with potential relevance to the species’ natural history, physiological adaptations and unique reproductive disposition

    Pangolin genomes and the evolution of mammalian scales and immunity

    Get PDF
    Pangolins, unique mammals with scales over most of their body, no teeth, poor vision, and an acute olfactory system, comprise the only placental order (Pholidota) without a whole-genome map. To investigate pangolin biology and evolution, we developed genome assemblies of the Malayan (Manis javanica) and Chinese (M. pentadactyla) pangolins. Strikingly, we found that interferon epsilon (IFNE), exclusively expressed in epithelial cells and important in skin and mucosal immunity, is pseudogenized in all African and Asian pangolin species that we examined, perhaps impacting resistance to infection. We propose that scale development was an innovation that provided protection against injuries or stress and reduced pangolin vulnerability to infection. Further evidence of specialized adaptations was evident from positively selected genes involving immunity-related pathways, inflammation, energy storage and metabolism, muscular and nervous systems, and scale/hair development. Olfactory receptor gene families are significantly expanded in pangolins, reflecting their well-developed olfaction system. This study provides insights into mammalian adaptation and functional diversification, new research tools and questions, and perhaps a new natural IFNE-deficient animal model for studying mammalian immunity.University of Malaya and Ministry of Education, Malaysia [UM.C/HIR/MOHE/08]; UMRG grant from the University of Malaya and Ministry of Education, Malaysia [RG541-13HTM]; Russian Ministry of Science [11.G34.31.0068]; NIH-NHGRI grant [5U54HG00307907]SCI(E)[email protected]

    Chromosomal-level assembly of the Asian Seabass genome using long sequence reads and multi-layered scaffolding

    Get PDF
    We report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical and genetic mapping along with shared synteny from closely related fish species to derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50 size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various regions across the species' native range. SNP analyses identified high levels of genetic diversity and confirmed earlier indications of a population stratification comprising three clades with signs of admixture apparent in the South-East Asian population. The quality of the Asian seabass genome assembly far exceeds that of any other fish species, and will serve as a new standard for fish genomics
    corecore